Progress Report: Predicting Which Recommended Content Users Click
نویسنده
چکیده
Machine learning models can be used to predict which recommended content users will click on a given website. The given dataset contains millions of samples that map some feature about an ad or web page to a number. We reduced this dataset to a more manageable size to minimize computation time, and we extract features based on this reduced set. The features we extracted are based on the advertisers and campaigns associated with the advertisements in the dataset. We initially built models based on Naive Bayes and logistic regression. We also built a model based on the support vector machine (SVM) using hinge loss. In addition, we constructed a neural network using the multilayer perceptron model to capture the non-linearity of features in order to obtain a better prediction score. The best result we obtained was using SVMs, and it yielded an accuracy of 0.46. The result can be scaled up to the complete dataset by optimizing our implementation using parallel computing.
منابع مشابه
An Ensemble Click Model for Web Document Ranking
Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...
متن کاملOptimization in Online Content Recommendation Services: Beyond Click-Through Rates
A class of online services allows Internet media sites to direct users from articles they are currently reading to other content they may be interested in. This process creates a “browsing path” along which there is potential for repeated interaction between the user and the provider, giving rise to a dynamic optimization problem. A key metric that often underlies this recommendation process is...
متن کاملEffective Learning to Rank Persian Web Content
Persian language is one of the most widely used languages in the Web environment. Hence, the Persian Web includes invaluable information that is required to be retrieved effectively. Similar to other languages, ranking algorithms for the Persian Web content, deal with different challenges, such as applicability issues in real-world situations as well as the lack of user modeling. CF-Rank, as a ...
متن کاملIncorporating Non-sequential interactions into Click Models
Click-through information is considered as a valuable source of users’ implicit relevance feedback. As user behavior is usually influenced by a number of factors such as position, presentation style and site reputation, researchers have proposed a variety of assumptions (i.e. click models) to generate a reasonable estimation of result relevance. The construction of click models usually follow s...
متن کاملThe Big Picture: Search and Discovery
These two types of retrieval systems have in common that they can be incredibly complex under the hood. The results they provide may depend not only on the content of the query and the items being retrieved, but also on the collective behavior of the system’s users. For example, how and what movies you rate on Netflix will influence what movies are recommended to other users, and on Amazon, rev...
متن کامل